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ON REDUCIBILITY AMONG COMBINATORIAL PROBLEMS 



Lems 



Abstract : A large class of combinatorial problt 
have been shown by Cook and Karp to be computationally 
equivalent to within a polynomial. We exhibit some 
new problems in this class, and provide simpler 
proofs for some of the known reductions. 



CHAPTER 1: THE PROBLEM. TERMINOLOGY. 



(1. 1) The Problem. 

(For the terminology see (1.3).) 

In 1970 S.A. Cook stated the problem whether a deterministic T.M. 
(Turing machine) can do in polynomial time what a nondeterministic one 
can do in polynomial time. His conjecture which has gained support since 
then is that the answer is negative. He showed that every language 
recognition problem (i.e. every decision problem) solvable in polynomial 
time by a nondeterministic T.M (we call this class of problems {MP ) 
can be reduced in polynomial time to the satisfiability problem (i.e. 
whether for a given propositional formula there is a truth assignment 
making it true) and also to a few other problems in Wr ([!])• We call 
such problems polynomial-complete in IN P or, for short, p-complete. 

In 1972, R.M. Karp listed 21 decision problems and showed that they 
are p-complete ([7]). In the meantime the family of known p-complete 
problems has been extended by S. Sahni ( CS Dept. , Cornell Univ.) and 
others. 

Cook's problem has become an important issue in complexity theory 
([10]). Its solution would provide some important clues about how to 
approach a large class of notoriously intractable computational problems 
in mathematical programming, operations research, theorem proving, and 
related areas. 
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(1.2) Comment . 

A general characterization of p-complete problems that allows one 

to decide algorithmically whether a combinatorial decision problem is 

p-complete or solvable in polynomial time w.r.t. (with respect to) the 

input lenght would be very desirable. There are deceptive relations 

between combinatorial problems. For example, a family {S.}, i£ (l,2,...,m], 

of finite subsets of some domain has a system of distinct representatives 

iff for every finite k and any choice of k distinct indices i,, ^o'' - '^' 

the subsets S. , S ,...,S. contain between them at least k distinct 

X l X 2 L k 
elements. One might be tempted to guess that the problem can only be 

decided by an exhaustive search. But Marshall Hall discovered an algorithm 

for its solution that takes only polynomial time ([6]). 

We started this work looking for some characterization of p-complete 

problems as mentioned above. The goal has not been achieved. In Chapter 2 

and 3 we are going to report on the by-products of our efforts. Both 

chapters have introductory sections. 

(1.3) Terminology. 

We assume the reader is familiar with the papers by Cook and Karp 
mentioned in (1.1). 

We are considering decision problems and always assume that they are 
given in some encoded form over a finite alphabet. So they become language 
recognition problems. 

Let P be a decision problem and L be some representation (encoding) 
of P over a finite alphabet S. We have L c 2T ) . Every element of L 



) IT denotes the set of all finite strings that can be formed by 
elements of S, including the empty string. 



represents an instance of P. - Assume P' is a special case (a subproblem) 
of P, that means using the same encoding as for P, P' is represented by a 
language L' and L' <= l. Then we call P' a contraction of P. - Analogously, 
if P" is a generalisation of P, that means using the same encoding as for 
P, P" is represented by a language L" and L" => L, then we call P" an 
extension of P. 

The class of problems solvable in polynomial time by a nondeterministic 
T.M. is denoted by Wr , the class of problems solvable in polynomial 
time by a deterministic T.M. by P . The time is always considered to be 
a function of the input length. We are only dealing with one-tape T.M. 's. 

Given a problem P 1 as a language 1^ over a finite alphabet ZL, and 
a problem P 2 as a language L 2 over a finite alphabet ZL. Assume there is a 
pping f: 1^ -» Z^ s.t. for every w £ Z^, f(w) e L« iff w e L f is 
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called a reduction from P 1 to P_. If there is a deterministic T.M. that 
performs the mapping f in polynomial time, then we say P. is Karp-reducible, 
or K-reducible to P„. For that we use the notation 

P x fc— ► P 2 . 
Some standard encodings of the problems involved are implicit in our use 
of this notation. 

Cook's notion of reducibility is different. Given two decision problems 
P and Q. Assume we have an oracle for solving Q in time 1. Then, according 
to Cook, P is reducible to Q in polynomial time iff there is an algorithm 
which decides P and may use the oracle for Q (more than once) and runs in 
polynomial time. Then we say, P is Cook-reducible, or C-reducible to Q, 
and we use the notation 

P G — *■ Q . 
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Obviously, when P K— »■ Q, then P G— *■ Q. But it is not known whether 
C-reducibility implies K-reducibi lity. Elsewhere, when we write 'P is 
reducible to Q ' , we mean K-reducibility. 

If s is a finite string of literals, we denote the length of s, i.e. 
the number of literals in s, by v(s). 

Assume a decision problem P.. is K-reducible to P 2 - Assume T^ is given 
as a language L. over a finite alphabet £. , i = 1, 2. Assume the reduction 
f : X^ -» lit is performed by a T.M. M. Let T(w) be the time M takes to 
compute f(w), and let cr(w) be v(f(w)). We define 

t(n) = max { t(w): v(w) <■ n } 
and call t(n) the reduction time of f (w.r.t. M) . 

We define 

s(n) = max { cr(w) : v(w) ^ n } 
and call s(n) the reduction space of f (w.r.t. M) . - Observe that s(n) is 
not the work space needed by M to perform f. 

Whenever we refer to time or space of a reduction we mean the reduction 
time or the reduction space, resp., as defined above. 

If t(n) is linear (quadratic), we say f is time-linear (time-quadratic). 
If s(n) is linear (quadratic), we say f is space-linear (space-quadratic). 

Assume the reduction P.. K — *■ P is such that f restricted to L^ is a 
bijective mapping onto L„, and f gives a reduction P„ K— *■ P^. Then we 
call P- isomorphic to P ? w.r.t. f. 

Assume there is a problem P eW s.t. for every Q el/r there is a 

reduction Q K > P. Then we call P K-p-complete in Iklf , or sometimes 

just p-complete . - If there is a problem P' e IN P s.t. for every Q S l/P 
there is a reduction Q G >■ P 1 , we call P' C-p-complete (in A/ f ). 



We are going to use Karp's names for the p-complete problems he listed 
in [7], except we call SUBSET SUM what he calls KNAPSACK. These names will 
be written in capitals. - A p-complete contraction of SAT is the one where 
we confine ourselves to propositional formulas in conjunctive form. We call 
this problem CF-SAT. - CF-SAT with at most three literals per clause or 
exactly three literals per clause is also p-complete. We refer to both cases 
by 3-CF-SAT and specify in the context which one we mean. 

A network is a graph with weighted arcs, where the weights are non- 
negative integers. The network may have sources with specified maximum 
(integer) supplies and sinks with specified minimum (integer) demands. 

(1.4) Remark. 

Though other parameters are sometimes used, it is common to give the 
computational complexity (in time and space) of a problem as a function 
of the input length v(w), where w is the input, i.e. an encoding of the 
given instance of the problem. Hence, this computational complexity depends 
on the encoding and thus on the alphabet Z) chosen. However, it turns out 
that within a range of encodings for a given problem, encodings that we 
informally would like to call 'standard encodings' (cf. (3.2)), the 
computational complexity is only slightly affected when the encoding 
is being changed. If the computational complexity is of some polynomial 
degree d, a change of the encoding may lead to a polynomial degree of 
2d or 3d. But a change of the encoding will not result in converting a 
complexity that is of higher than polynomial degree into a complexity 
of polynomial degree, and vice versa. In this sense our different encodings 
are invariant. The only exception to this remark is the difference between 
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unary ( or tally ) and radix notation for integers which may differ 
exponentially. We shall always assume integers are represented in radix 
notation, say base two for definiteness. 

We doubt that the length parameter v is always the most suitable one 
for expressing the computational complexity of the problems with which 
we are dealing. Probably, a parameter or a vector of parameters closer to 
the logical structure of our decision problems would be more revealing 
in particular cases. But length will be our main parameter in order to 
maintain comparability among the several differently structured problems 
we consider. We do not know a better one for this purpose. 
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Do this for every clause and form the union of all the unions (i) . Let 

1 
3" 



1 . *■ -L, 

B denote this union. Then, A is satisfiable iff B is --satisfiable. 



(2. 2) NWF with two flow intervals. 

We are going to consider network flows (NWF) . Always assume there is 
one source s and one sink t in the network. All pertinent data are supposed 
to be integer valued. 

Assume for a flow f. along Arc i we have the constraints s f ^ a 

where a is a nonnegative integer. There is an algorithm by Ford-Fulkerson- 

1 

Edmonds to determine the maximal feasible flow in the network. The 

3 
computational complexity of the algorithm is 0(n ), where n is the number 

of nodes ([3]). This algorithm starts with a feasible flow, e.g. the 

zero-flow, and augments it step by step. 

Now allow upper and lower bounds for the flows f., s.t. 

( < ) b. £ f. ^ a. 
111 

for every arc i. Here the first question is whether there is a feasible 

flow at all. But one can start with the zero flow and apply the algorithm 

mentioned before repeatedly, namely once for increasing a flow f along 

Arc i, given f. < b., in order to get b.^ f. ( < a.), and, at worst, 

once for every i. If the computation becomes stymied no feasible flow 

exists. The overall computation is, at worst, 0(n ). The procedure was 

outlined by Lawler ([8]). 

To get a p-complete problem we allow two flow intervals for an arc i 

such that either d. s f. s: c. or b. < f. s a . for given a., b., c., d ; 

111 111 1111 

a. S: b. > c. > d. . The decisive question is whether there is a feasible 
1111 

flow. This problem is called WF with two flow intervals . 
The problem is obviously in IV IP. We show 
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SUBSET SUM K > NWF with two flow intervals 

Given a set of positive integers S = { s 1 , s , . . . 

and a positive integer s . The Subset Sum Problem : Is there a subset 



s 1 
n ' 



S 1 of S such that £ s = s ? 

r> 1 

ing network : 



For the reduction we use the foil 



ow- 



Source s 




[s] 



-»• Sink t 



There are n arcs from s to r the ith of which allows a flow of value 
or s.. From r to t the flow has to have value s. The Subset Sum Problem 
has a solution iff there is a feasible flow from s to t in the network. 



(2.3) CTP with quadratic optimization function. 

Given a network with sources and sinks. A source may have a limited 
supply, and at each sink a demand has to be satisfied. An arc must carry 
a flow between and some positive upper bound, the capacity of the arc. 
There is a cost c. per unit associated with the flow f. along the 
directed arc i. 

The problem whether there is a flow of size v with a cost not excee- 
ding k, for given v and k, is a wellknown linear programming problem, 
often called the Capacitated Transshipment Problem ( CTP ) . (We assume 
that all the pertinent data are integer valued.) If there is a solution, 
then there is also an integer solution ([5]). The optimization function is 

£ f. c. , 
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where A is the set of arcs of the network. 

Remark: Lawler gives an algorithm for CTP with a computational 

2 
complexity 0(v.n ), where n is the number of nodes in the network ([8]). 

But since we assume that the data including v are given in radix notation, 

the algorithm is not polynomial-time bounded with respect to the input 

length. We do not know whether CTP is in IP . 

In CTP, an arc i that carries a flow of value f . contributes c ^ 

to the cost. Now we allow this cost contribution to be quadratic in f^ 

2 
c„.« f . + c. .• f . + c , 
2i l li l Oii- 
where c,, c and c„. are the coefficients that determine the cost 
2i li Oi 

contribution of arc i. So the optimization function becomes quadratic 

in the f . 's. 

l 

The problem whether there is a flow v of cost zero (for given v) 
is K-p-complete. We call this problem CTP with quadratic optimization 
function . - (We are considering integer data and integer solutions only.) 

To prove this we show the reduction 

SUBSET SUM K *~ CTP with quadratic optimization 

function . 

The problem is obviously in IMF.- Given a set of positive integers 

S = [ s«, s„, ...., s }> 

and a positive integer s. The Subset Sum Problem: Is there a subset S' 

of S, such that £ s. = s ? 

For the reduction we construct the following network: 

[0,s 1 ] 
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There are n arcs between the source and the sink. Along Arc i the flow 
value must be in the interval [0, s.], i = 1, 2, ..., n. 

Consider the cost function 

n 

S f. (s. - f.) . 
i=l X ± L 

The Subset Sum Problem has a solution iff there is a feasible flow of 
size s with cost zero. 



(2.4) Summary. 

Theorem 1 : The following problems are K-p-complete in IN If: 

a) --DF-SAT , 

b) NWF with two flow intervals , 

c) CTP with quadratic optimization function . 
The definitions and proofs are given in (2.1), (2.2), and (2.3). 

(2.5) 3-NODE COVER is p-complete. 

In the Node Cover Problem ( NODE COVER ) we have a graph G and an 
arbitrarily given positive integer k. The question is whether all the 
arcs of G can be covered by at most k nodes (whether there is a node 
cover of size k) . 

Now assume k is a function of the number of nodes n. This is a 
different problem that we denote by NODE C0VER(k(n) ) . Here the decisive 
question is whether there is a node cover of size ik(n)L For example, 
NODE COVER ( ? ) is the problem that asks whether there is a node cover 
of size r ; 1. (We let n always denote the number of nodes in the 
graph.) 
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x for the first x, x for the second x, x for the third x, x, for the 
first x, and x for the second x . - Extend F by 

A (x V X;L ) A (x x V x 2 ) A (x 2 V x 3 ) A (x~ 3 V x ) A (x~ 4 V x ) A (x V x) 
that means 

A ( X 4 Xj -t x. ■• x ■» x, ■» x -t x ) . 
Do this for all variables occuring in F to get F'. Make sure that a new 
variable is introduced for every literal of F. 
Example: 

F = (x V y V z) A (x V y V z) A (x V y V z) . 
F' = (x x V y x V z 1 ) A (x 2 V y 2 V zj A (x 3 V y g V zj 
A (x V Xl ) A (x x V x 2 ) A (x V x 3 > A (x V x) 
A (y V Yi ) A (y 1 V y 2 ) A (y 2 V y 3 > A (y g V y) 
A (z V Zi ) A (z x V z 2 ) A (z 2 V z 3 ) A (z g V z) . 
Assume F has 3p literals and m different propositional variables. 
Then, F' has 

3p + (2-3p + 2m) = 9p + 2m 
literals each of which is incompatible with at most three others. Hence, 
if G' is the corresponding graph of F 1 , we have G' G (£, . 

Obviously, F is satisfiable iff F' is satisfiable. F' is satis- 
fiable iff G' has a node cover of size 2p + (3p + m) . This follows 
immediately from the structure of F' and from the following reductions 
as given by Karp ([7]): 

CF-SAT K > CLIQUE K >■ NODE COVER . 

(2.6) NODE COVER ( cm ) is p-complete. 

We refer to the definitions given in (2.5). 
From Karp's reductions 
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CF-SAT K y CLIQUE K > NODE COVER 

it follows immediately that NODE COVER (2n/3 ) is p-complete. The 
following theorem is a generalisation thereof. 

Theorem 3 : For any fixed rational a s (0, 1), 

NODE COVER ( cm ) is K-p-complete in Ik/?. 
Proof of Theorem 3: We are going to show 

3-CF-SAT K *- NODE COVER ( era ) . 

Let F be a propositional formula in conjunctive form with exactly 
three literals per clause. Assume F has p clauses. Let G be the corre- 
sponding graph of F. G has n = 3p nodes. 

We know from Karp's reductions that F is satisfiable iff G has a 

node cover of size 2n/3 . 

2 
First, assume a e (0, -) . We would like to reduce NODE COVER (2n/3) 

to NODE COVER ( cm ) . For that, add q isolated nodes to G; call the 

new graph G-. . Then, G has 3p + q.. nodes. There is a node cover of 

2p nodes for G iff there is a node cover of 2p nodes for G . We want 

T cf.(3p + q x ) 1 = 2p , or 
2 



T or.(n + q x ) 1 = ^n . 



According to this equation, q, is essentially a linear function of n. 

2 
Now assume a e ( -, 1). G has n = 3p nodes. Add a complete graph 

of q nodes to G; call the extended graph G . G has a node cover of 
2p nodes iff G„ has a node cover of 2p + q - 1 nodes. We want 

T c*.(3p + q 2 ) 1 = 2p + q 2 - 1 , or 

T q--( n + q„) 1 = ;ii + q„ - 1 . 
Obviously, q„ is essentially a linear function of n . 
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(2.7) NODE COVER (^/n" 1 ) is p-complete. 

With respect to Theorem 3 one may ask for further generalisations. 
Our next theorem gives one answer. 

Theorem 4 : Given a fixed integer m greater than 1. 

a) NODE COVER (Vn 1 ) is K-p-complete. 

b) NODE COVER ( n - —AT 1 ) is K-p-complete. 

Proof^ For example, we can reduce NODE COVER (n/2) to the problems 
given in a) and b) . Then the proof is analogous to the proof of Theorem 
3, i.e. we add isolated nodes or complete graphs to the given graph 
to get the desired ratio between n and the size of the node cover. 

To carry out the reductions, we start with a graph G which has n 
nodes. Assume n is even ( otherwise add an isolated node ). 

In Case a) we add q^ isolated nodes to G and call the new graph G . 
G^ has n + q^ nodes. G has a node cover of size n/2 iff G.. has a 

node cover of size n/2. Thus we want 

n _ m / ! 

2 - ~V n + q 1 • 

In Case b) we add a complete graph with q nodes to G and call the 
new graph G^. G 2 has n + q nodes. G has a node cover of size n/2 iff 

G 2 has a node cover of size = + q - 1 . Thus we want 

n i -i , m / ; ' 

2 + q 2 -l = n+q 2 - ~V n + q 2 , or 

n i i m / > 

2 + l = ~^/ n + q 2 

5f™?ElSi Since both q and q are 0( n™) , the reduction space 

itself is 0( n ). We do not know better reductions and conjecture 

that the reduction space for any reduction 

NODE COVER ( n/2 ) K 9- NODE COVER ( A/V ) 

is 0( n ) . 
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the problem is whether there is a subset of k columns that covers all 
rows. - One could formulate dual problems in the same way for SET 
PACKING and SET COVERING. 

(2.9) 3-CHROMATIC NUMBER is p-complete. 

The problem 3-CHROMATIC NUMBER asks whether a given graph is 
three-colorable. It was shown by Larry Stockmeyer that this problem 
and the contraction restricting it to planar graphs are p-complete 
([12]). 
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CHAPTER 3: ON REDUCTIONS AMONG K-p-COMPLETE PROBLEMS. 



(3.0) In this chapter we report some efforts made to investigate the 
computational complexity of the reductions among p-complete problems. 
A motivation is given in (3.1). In Sections (3.2) and (3.3) we give 

a few observations on encoding and space complexity. The rest of 
the chapter exhibits a few new reductions we found among known p-com- 
plete problems. 

(3. 1) By Cook's Reduction Theorem ([1]) every problem solvable in 
nondeterministic polynomial time is K-reducible in polynomial time 
to SAT. The reduction time (i.e. the polynomial ) depends on the 
problem, i.e. on the special nondeterministic T.M. and thus also on 
the encoding. 

Let II be a problem having deterministic time lower bound n . 

Assume, for example, there is a reduction II K >■ SAT that has 

reduction time t (n) and reduction space s (n). t and s are 
r r r r 

polynomials in n. Assume SAT has (deterministic) time upper bound 
T(n). Then, 

t r (n) + T(s r (n)) 
is an upper bound on the deterministic time complexity of IT . 

If degree t (n) < r , then t(s (n)) must be at least 0(n ). 
Hence, t must be at least 0( n e 8( s r>) # xhi s observation may 
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allow us to prove large polynomial lower bounds on complete problems 
such as SAT even if we are unable to prove that SAT % F 

Therefore we were looking for problems with deterministic poly- 
nomial time lower bounds of high degree that can be reduced to some 
p-complete problem, e.g. to SAT, easily, at least such that it takes 
space bounded by a lower degree polynomial. We did not achieve what 
we wanted but became interested in the computational complexity of 
the reductions. 

(3f 2 ) Time and space complexity of a reduction depends on encodings 
of problems as well as on the reduction device. Here we use one-tape 
T.M. 's. 

The objects of the known p-complete problems are propositional 
formulas, graphs, graphs with weighted arcs, finite sets of integer 
equations, or sets of subsets of a finite domain. For each of these 
problems we have two or three encodings that we would like to call 
standard encodings. We are going to outline what kind of encodings 
we have in mind. 

Assume we have in each case some convenient finite alphabet E. 

When we distinguish items of the same type, like variables or nodes, 

we use indexing and binary notation. For example, we could represent 

five nodes of a graph by NO, Nl, N10, Nil, N100, where N e £ . 

Integers are represented in binary notation. 

In case of propositional formulas we think about the following 
encodings : 

clause-by-clause, where essentially every clause is a list of 
the literals occurring in it; 
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the incidence matrix given row by row, consisting of O's and 

l's, one row for each clause and one double column for each 
variable ( a double column because the variable may occur 
affirmative or negative); 
an encoding like the previous one, but O's, +l's and -l's in the 
matrix and a single column for each variable, using +1 for 
an affirmative and -1 for a negative occurence of a variable. 
In case of graphs we think about the encodings: 
a list of all nodes followed by a list of all arcs, an arc given 

as a pair of nodes; 
the incidence matrix given row by row using O's and l's, rows 

for the nodes and columns for the arcs; 
the adjacency matrix given row by row. 

In case of subsets of a finite domain we think about the encodings: 
a list of all subsets, where every subset is a list of the elements 

occuring in it; 
the incidence matrix given row by row, a row for every subset 

and a column for every variable in the domain. 
For a set of equations we think about the encodings : 
equat ion-by- equat ion ; 
the coefficient matrix of the system of equations, given row 

by row. 
These are essentially the encodings we refer to when we use the 
term 'standard_encoding ' . 

(3.3) The reduction space of the reductions within the family of 
p-complete problems was considered in some detail. One can find standard 
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p-complete problems given in some standard encoding, v is not linear 

(on a one-tape T.M. ), but in many cases quadratic in the input length 

3 

n, sometimes even worse but not exceeding 0(n ). 



§2™ e _2 e ^_ES^££i22S_§22DS_^B2Y2_E" com E-'- ete 2 r °blems 

(3.4) 3-CF-SAT K > EXACT COVER 

Given a propositional formula F in conjunctive form with exactly 
three literals per clause. Assume F has m clauses and r variables occur 
( uncomplemented or complemented ). Let C. be the clauses, i = 1, 2, .. 
.., m , and x. be the variables, j = 1, 2, ..., r. Form the following 
array: 





X l X l 


X 2 ( X 2 


X 3 


' X 3 




X ' X 

r i r 




c l 


X ' 


1 
1 x 
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i 
l 
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i x 


1 
I 




i 




i 
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C 3 


X 1 
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1 

i 




l 
X 1 
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• 








i 




1 

1 


ft 


■ 








i 




1 
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t 




1 




i 




1 


» 


C 

m 




! 

1 




i 

I X 
1 




1 

1 


X 




X 


X 


X 




x . 





There is a row for every clause C and a double column for every 
pair (x., x.). A cross X occurs in Row C. and Column x (or x~ ) 
iff in Clause i the literal x. (or x.) occurs. In addition, we have 
one cross for every row in an additional column at the right hand 
side, and one cross for every double column at the bottom of the array. 

The domain of the Exact Cover Problem is the set S of all these 
crosses, i.e. each cross is an element in S. We form a system of subsets 
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3m :£ 2 r . This shows that our reduction w.r.t. the encodings 
outlined above is space-linear. 

2. A space-linear reduction SAT K y 3-CF-SAT was given by 

Fischer, Meyer and Paterson ([4]). Hence we have a space-linear 
reduction SAT K — *■ EXAGT COVER. 

3. It was shown by Michael J. Fischer that our reduction 

3-CF-SAT K ► EXACT COVER can be modified to a reduction 

CF-SAT K — > EXACT COVER that is quadratic in space. 

(3.5) CF-SAT K *- directed HC + ). 

Karp ([7]) gives a reduction NODE COVER K — *> directed HC 
that is rather complicated and quadratic in space. The reduction we 
are going to describe here is linear in space. 

Given a propositional formula F in conjunctive form. Assume F 
has m clauses C^, i = 1, 2, ..., m, and r variables x. occur in F, 
J = 1» 2, ..., r. - Essentially we use the same array as the one used 
in (3.4). We have a row i for every clause C. and columns for x- , x 1 , 
* 2 » x 2» •**» x r ' x r * We draw a box in a column in Row i iff 
the literal corresponding to this column occurs in C . . In addition we 

draw a cross A. below and a cross B. above each double column i . 
J j J * 

j = 1, 2, ..., r, and a box jij for each row i, i = 1, 2, ..., m, 
on the right hand side of the array. 

Examgle: F = C A C A C, A C, , 

C Y = x L V x 2 V x 5 , C 2 = x t V x 3 V x 4 , 
C 3 = x 2 V x V x 5 , C, = x_ V x, V x 5 . 



) HC = Hamiltonian Circuit. 
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From the array we construct a graph G. The crosses will become nodes, 

and the boxes g and Q are going to be subgraphs of G. 

Each subgraph jxj is isomorphic to the following one having three 
nodes : 



t 



3 o< 

1 

2 5 

t 



< ol 

1 

There is one chain of arcs upwards along each column of the array, 
entering the subgraphs § at 1 and leaving them at 3. For a double 
column the two chains start at A. on the bottom and end at B on the 
top. Bj is connected to A , j = 1, 2, ..., r-1, and B r to [j] by 
arcs. 

There are arcs from fij to the jxj 's of Row i entering an (xj 
always at its Node 3, and we have an arc from each jxj of Row i to 
HJ leaving the jxj at its Node 1. The JiJ 's are described below, 
but each [ij has - among other nodes - an entrance node P. and an exit 
node Q ± . B^ is connected to P , Q ± to P for i = 1, 2, ..., m-1, and 
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<* m t0 A 1 ' 
m 1 

The idea of the construction is the following. A HC must go 

through A- and then follow either Column x.. or Column x.. up to B... 

Observe that the structure of the subgraphs [x) and the way they are 

connected with each other along the columns forces the path to proceed 

along one and the same column after A. is passed until B. is reached. 

J J 

Depending on the truth assignment, we choose Column x.. if x.. is true, 
and Column x.. if x is false. From B 1 we come to A and go up Column x„ 
if x~ is true and up Column x if x„ is false, and so on. 

After the path has reached B it must pick up all the nodes in 
the [jj 's, starting at [l] , then going to j2j , and so on, until jr] 
is reached. While picking up the nodes of ji] , the rest of the nodes in 
Row i must be picked up. Given there are r. literals in Clause i, 

jij is constructed in such a way that from it at the most r. - 1 [x] 's 
can be picked up. 

As far as the [i] 's are concerned, our original solution was for 
three literals per clause only. It has been generalized and substanti- 
ally improved by Michael J. Fischer and Joel Seiferas. They gave the 
following elegant solution for what the subgraph [i] has to perform. 
For example, assume we have 5 literals in Clause i, i.e. 5 [x] 's 
in Row i. These [x] 's are connected with fij as follows: (See the 

figure on the next page.) 
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has nodes P., Q., and R , j = 1, 2, 3, 4, 5. It is easy to 
convince oneself that a path from P. to Q can pick up all the R..'s 
and an arbitrary groger subset of the 's, but never all the |xj 's. 
The generalisation for any number of literals per clause is obvious. 
Thus we have shown: F is satis fiable iff G has a HC. 



(3.6) Undirected HC K »- SAT. 

We describe a very simple reduction requiring only linear space. 
This reduction is best described by an example. 

Example: 



Graph G: 




G has nodes N. and edges E. 
i 3 



i = 1, 2, 3, 4, 5; 

j = 1, 2, 3, 4, 5, 6, 7. 
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Consider Node N.. . A HC must include 

E n and E_ and not E, 

15 6 

or E n and E, and not E_ 

16 5 

or E s and E, and not E.. . 

Therefore we associate a Boolean variable x. with Edge E., j = 1, 2, 3, 

J 1 

4, 5, 6, 7, and write the following five formulas: 
F, : x n x,x c V x,x c x, V x_x-x, 

i 1 O D 1 D O D O I 

F 2 : Xl x 2 



F 3 : X 2 X 3 X 6 X 7 V X 2 X 6 X 3 X 7 V X 2 X 7 X 3 X 6 V X 3 X 6 X 2 X 7 V X 3 X 7 X 2 X 6 



V x 6 x 7 x 2 x 3 



F 4 : X 3 X 4 

F 5 : x 4 x 5 x ? V x 4 x 7 x 5 V XgX^ . 

It is easy to see that 

F = F l A F 2 A F 3 A F 4 A F 5 
Is satisfiable iff G has a HC. 

Remark: The reduction can be changed easily to the reduction 

directed HC K p- SAT . 
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